REPORT  DOCUMENTATION  PAGE 


Public  reporting  burden  for  this  collection  of  information  is  estimated  to  average  1  hour  per  response,  includir 
gathering  and  maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Sen< 
collection  of  information,  including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Service 
Davis  Highway,  Suite  1204,  Arlington,  VA  22202-4302,  and  to  the  Office  of  Management  and  Budget,  Paper 


AFRL-SR-BL-TR-00- 


1.  AGENCY  USE  ONLY  (Leave  blank)  I  2.  REPORT  DATE 


4.  TITLE  AND  SUBTITLE 

NonLinear  Robust  Control:  Theory  and  Applications 


3.  REPORT  TYKt  mivu 


1  April  1995  -  31  March  1998 


5.  FUNDING  NUMBERS 

F49620-95- 1 -0296 


6.  AUTHOR(S) 

William  M.  McEneaney 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Camegie-Mellon  University 
Office  of  Research  Contracts 
5000  Forbes  Avenue 
Pittsburgh,  PA  15213-3890 


9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

AFOSR 

801  N.  Randolph  Street,  Room  732 
Arlington,  VA  22203-1977 


8.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


10.  SPONSORING/MONITORING 
AGENCY  REPORT  NUMBER 

F49620-95- 1  -0296 


12a.  DISTRIBUTION  AVAILABILITY  STATEMENT 

Approved  for  Public  Release. 


12b.  DISTRIBUTION  CODE 


13.  ABSTRACT  (Maximum  200  words) 

There  are  three  main  components  to  the  work:  Nonlinear  Robust  Escape,  Nonlinear  Hr.  and  Nonlinear  Robust  Tracking.  For 
the  first,  we  use  a  numerical  method  based  on  generalized  characteristics.  An  application  to  the  compressor  stall  problem  is 
given.  For  the  second  component,  we  have  developed  a  method  for  isolating  the  "correct"  solution  in  the  case  of  fixed 
feedback  control,  This  is  extended  to  some  active  control  cases  as  well.  We  have  employed  the  generalized  characteristics 
method  on  the  Aerospace  Plane  Ascent  I)rot)]em  and  verified  that  it  does  indeed  attenuate  the  disturbances.  For  the  third 
component,  we  are  generating  software  for  Robust  Tracking  of  nonlinear  systems  with  state  spaces  of  several  dimensions. 
There  are  some  open  questions  regarding  the  numerics  in  this  case. 


20001205  088 


15.  NUMBER  OF  PAGES 

33 


16.  PRICE  CODE 


17.  SECURITY  CLASSIFICATION  18.  SECURITY  CLASSIFICATION  19.  SECURITY  CLASSIFICATION  20.  LIMITATION  OF  ABSTRAC 
OF  REPORT  OF  THIS  PAGE  OF  ABSTRACT 


DUG  QUALITY  ms 


Standard  Form  298  (Rev.  2-89)  (EG) 

Prescribed  by  ANSI  Std.  239.18 

Designed  using  Perform  Pro,  WHS/DIOR,  Oct  94 


Abstract  for  F49620-95- 1-0296 


There  are  three  main  components  to  the  work:  Nonlinear  Robust  Escape.  Nonlinear 
.  and  Nonlinear  Robust  Tracking.  For  the  first,  we  use  a  rfumerical  method  based  on 
generalized  characteristics.  An  application  to  the  compressor  stall  problem  is  given.  For  the 
second  component,  we  have  developed  a  method  for  isolating  the  "correct  solution  in  the 
case  of  fixed  feedback  control.  This  is  extended  to  some  active  control  cases  as  well.  We  have 
employed  the  generalized  characteristics  method  on  the  Aerospace  Plane  Ascent  problem 
and  verified  that  it  does  indeed  attenuate  the  disturbances.  For  the  third  component,  we  are 
generating  software  for  Robust  Tracking  of  nonlinear  systems  with  state  spaces  of  several 
dimensions.  There  are  some  open  questions  regarding  the  numerics  in  this  case. 


1 


Nonlinear  Robust  Control:  Theory  and  Applications 
Final  Report  for  AFOSR  Grant  F49620-95-1-0296 

William  M.  McEneaney 


1.  Introduction 

This  final  report  is  focused  on  methods  for  the  reduction  of  the  effects  of  unwanted 
disturbances  on  aerospace  systems. 

The  approach  taken  was  that  of  HTO,  or  more  generally  Robust,  control.  This  is  an 
approach  where  one  views  the  disturbances  in  the  system  as  antagonistic  to  our  goal  and 
attempts  to  attenuate  their  effect  on  the  system.  These  disturbances  are  assumed  to  have 
finite  energy  (I2  processes).  One  may  contrast  this  with  the  methods  of  stochastic  control 
where  the  disturbances  in  the  system  are  random  rather  than  antagonistic,  but  more  impor¬ 
tantly  are  typically  diffusion  processes.  Diffusion  processes  are  driven  by  Brownian  motion 
which  has  infinite  energy.  This  contrast  between  Robust  Control  and  control  of  diffusion 
processes  in  terms  of  energy  of  the  disturbances  has  important  numerical  ramifications.  In 
particular,  the  dynamic  programming  equations  (DPE’s)  associated  with  Robust  Control 
are  first-order  while  those  associated  with  stochastic  control  are  second-order.  We  make 
use  of  fast  numerical  techniques  for  first-order  equations  which  are  not  applicable  to  the 

second-order  case. 

We  dealt  primarily  with  nonlinear  systems.  Hoc  control  was  developed  first  for  linear 
systems  in  the  frequency  domain.  Later,  a  state  space  formulation  was  found.  This  state- 
space  formulation  allowed  one  to  generalize  to  nonlinear  systems  and  to  other  paradigms 
(such  as  Robust  Escape).  In  the  state-space  formulation,  Robust/Hcc  control  formulations 
take  the  form  of  differential  games.  The  corresponding  DPE’s  are  Isaacs  equations  (or  in  t  e 
case  of  a  fixed  feedback  control,  Hamilton-Jacobi-Bellman  equations).  For  many  problems, 
it  has  been  shown  that  the  correct  weak  solution  of  such  a  DPE  is  the  viscosity  solution. 
The  solution  of  the  DPE’s  is  at  the  heart  of  our  control  approach. 

There  are  two  philosophies  regarding  computation  of  controls.  The  first  is  to  pre¬ 
compute  the  feedback  controls  and  store  them  in  look-up  tables  which  are  read  in  real-time. 
The  second  is  to  generate  the  optimal  (or  sub-optimal)  controls  in  real-time  by  some  rea¬ 
sonably  fast  algorithm.  Both  approaches  have  merit.  The  latter  approach  has  the  advantage 
that  the  control  system  may  adapt  to  changing  parameters  in  real-time,  whereas  this  may 
not  be  feasible  for  the  former  if  the  parameter  space  becomes  too  large.  With  nonlinear 
control,  the  computation  of  controls  in  real-time  has  always  been  a  critical  question.  We 
used  methods  which  may  be  used  in  the  first  approach  (off-line)  or  for  lower  dimensional 
systems  in  the  second  approach  (real-time  algorithm).  It  is  a  fundamental  point  that  we 


1 


believe  that  these  algorithms  are  applicable  in  real-time  for  appropriate  nonlinear  systems. 
This  allows  us  to  overcome  a  hurdle  which  has  often  stood  in  the  way  of  nonlinear  control 
and  the  dynamic  programming  approach  to  nonlinear  control  in  particular.  The  algorithms 
use  a  generalization  of  the  concept  of  characteristics  which  is  only  available  for  first-order 
PDE's  (not  applicable  to  stochastic  control).  The  use  of  characteristics  allows  us  to  sohe  the 
DPE  locally  rather  than  over  the  whole  state  space.  This  technique  bears  some  resemblance 
to  the  Pontryagin  Maximum  Principle,  but  is  applicable  to  Robust/H,*  control  problems. 

The  bulk  of  the  work  is  broken  down  into  three  areas:  Nonlinear  Robust  Escape  For¬ 
mulation,  Nonlinear  Hoc  Control,  and  Robust  Filtering.  For  Robust  Escape,  we  discuss 
the  current  state  of  the  theory,  present  examples  and  demonstrate  some  initial  software 
which  has  been  applied  to  compute  controls  for  the  Compressor  Stall  problem.  We  then 
discuss  nonlinear  Control  in  general,  and  present  an  application  to  Aerospace  Plane 
Ascent  where  we  verify  in  a  simulation  that  our  algorithm  does  indeed  produce  an  con¬ 
troller.  Lastly,  we  discuss  a  Robust  Filter.  This  work  was  initially  started  in  response  to  the 
partially-observed  Hoc  control  problem.  However,  it  quickly  became  clear  that  the  concept 
of  attenuating  disturbances  in  a  robust  fashion  for  tracking  applications  has  several  advan¬ 
tages  similar  to  those  presented  in  standard  Robust/Hro  Control.  Since  tracking  absolutely 
requires  real-time  algorithms,  our  approach  appears  quite  fruitful. 

Since  the  document  is  quite  long,  we  indicate  briefly  here  the  main  areas. 

Nonlinear  Robust  Escape: 

We  are  currently  using  a  method  of  characteristics  for  Robust  Escape  software.  Some 
results  for  this  method  were  obtained  for  the  H*,  case  but  not  for  Robust  Escape.  An 
application  to  the  compressor  stall  problem  is  given. 

Nonlinear  H 

We  have  examined  the  possible  multiple  solutions  to  the  DPE,  and  have  de\  e  ope 
a  method  for  isolating  the  “correct”  solution  in  the  case  of  fixed  feedback  control.  This  is 
extended  to  some  active  control  cases  as  well.  We  also  have  some  results  on  the  connection 
between  the  characteristics  and  viscosity  solutions  for  Hoc  control.  We  have  employed  the 
method  on  the  Aerospace  Plane  Ascent  problem  and  verified  that  it  does  indeed  work. 

Nonlinear  Robust  Tracking: 

We  are  generating  software  for  Robust  Tracking  of  nonlinear  systems  with  state  spaces 
of  several  dimensions.  There  are  some  open  questions  regarding  the  numerics  in  this  case.  In 
particular,  in  the  presence  of  very  high  quality  measurements,  the  state-space  components 
of  the  characteristics  diverge  rather  rapidly. 


2.  Robust /Hoc  Control  of  Nonlinear  Systems 

Robust/Hoc  control  techniques  attempt  to  bound  the  system  cost  by  some  measure  of 
the  disturbance  size.  The  most  common  measure  of  disturbance  size  is  the  i2  norm.  To  be 


2 


more  specific,  consider  a  system  of  the  form 


^-  =  f(X,u)  +  a(X)w 
A'(0)  =  x 


(2.1) 


where  X  is  the  state,  u  is  the  control  and  w  is  the  disturbance.  Let  us  specifically  assume 
throughout  that  X  takes  values  in  &n,  u  takes  values  in  some  compact  set  U,  and  w  takes 
values  in  9?m.  One  typically  wants  to  obtain  a  control  that  bounds  the  cost  to  the  controller 
from  above  by  some  function  of  the  initial  condition,  x,  and  the  Z,2-norm  of  w(-).  In  the  H-c 
set-up,  this  typically  takes  the  form 

fQCmt)\'  +  C,\u(t)?dt  <  \w(t)\'dt  +  W(x)  VT€( O.oo)  (2.2) 

where  W{0)  =  0.  In  this  case,  W  is  referred  to  as  a  storage  function,  and  y  is  referred  to  as  the 
disturbance  attenuation  constant.  One  wishes  to  find  a  controller  u'  such  that  (2.2)  holds 
for  all  w  e  L‘fc  where 

rT 

L‘2°c  =  {u- :  [0,  oo)  -+  3?"  :  /  M<)|2  dt  <  oo  VT  <  oc}.  (2.3) 

Jo 

Note  that  in  the  absence  of  a  disturbance  (u>  =  0),  (2.2)  implies  that  X(t)  —  0  as  t  —  oo.  (This 
is  referred  to  as  internal  stability.) 

Another  robust  paradigm  of  significant  interest  (to  be  described  more  fully  below)  is  the 
Robust  Escape  Problem.  In  this  case,  there  is  some  set  G  that  we  would  like  to  keep  the 
state  X(t)  in  as  long  as  possible.  In  this  case,  assume  x  €  G.  Let  t  be  the  first  time  the  state 
leaves  the  set  G.  The  robust  criterion  takes  the  form 

r> _ -  (2.4) 

"  0  +  Hfo  MOI2* 


where  6  is  a  design  parameter  analogous  to  the  Hoc  disturbance  attenuation  constant  y. 
Consequently,  if  the  average  power  \  /0T  M2  <  P,  then  we  have  the  cost  bound 


This  is  the  escape  form  of  the  Robust/Hoc  approach.  This  form  is  also  quite  similar  to  the 
Finite  Power  Gain  approach  recently  being  considered  by  James  and  Dower.  [DJ]. 

There  are  other  systems  for  which  one  can  apply  Robust  techniques  such  as  nonlinear 
finite  time  horizon  systems  [Mcr],  and  Markov  Chain  models  [FHH],  [FGM]. 

In  the  following  subsections,  we  explore  Robust  Escape  theory  and  applications  (Com¬ 
pressor  Stall),  Hoc  control  theory  and  applications,  and  recent  numerical  methods  for  these 
and  other  problems. 


2.1  Nonlinear  Robust  Escape  Problems 


3 


We  first  discuss  the  theory  including  examples  and  analysis  of  numerical  methods.  Then 
we  turn  to  a  specific  application  to  compressor  stall  control. 

2.1.1  General  Theory 

We  briefly  review  the  general  theory  of  Robust  Escape  control. 

Consider  a  system  of  the  form  (2.1)  where  x  €  G.  Suppose  that  G  (the  set  we  wish  to 
keep  the  state  in)  is  open  and  that  the  closure  of  G,  G,  is  compact.  Suppose  that  G  satisfies 
an  exterior  sphere  condition,  that  is  for  any  x  in  the  boundary  of  G  there  exists  a  y  £  G 
and  an  r  >  0  such  that  Br(y)  nC  =  i0  where  Br(y)  is  the  closed  sphere  of  radius  r  centered 
at  y.  (Any  convex  set  or  set  with  smooth  boundary  certainly  satisfies  this  condition.)  The 
Robust  Escape  problem  may  be  formulated  as  a  zero-sum,  deterministic,  differential  game 
where  u  represents  the  control  for  the  minimizing  player,  and  w  represents  the  control  for 
the  maximizing  player.  The  payoff  (which  these  players  are  trying  to  control)  takes  the  form 

P(x,u(-),w(-))  =  -J  +  dt ■  l2-5) 

Let  U  be  the  set  of  measurable  controls  taking  values  in  U.  Let  W  =  be  the 

space  of  disturbances.  The  Elliott-Kalton  value  of  the  game  is  used  ([EK],  [ES]).  To  be 
specific,  a  strategy  for  the  minimizing  player  (our  control)  is  a  mapping  from  VV  to  U  which 
is  non-anticipative.  That  is,  4>  :  W  —  U  is  a  strategy  for  the  minimizing  player  if  for  any 
t  €  [0,  T]  and  any  w,  w  e  W  such  that  w(r)  =  w{r)  for  all  r  €  [CM],  one  has  <*M(r)  =  <?[ff](r)  for  all' 
r  e  [0,f].  This  condition  guarantees  that  the  strategy  for  the  minimizing  player  will  not  have 
foresight  of  the  behavior  of  the  maximizing  player.  Let  this  set  of  strategies  be  denoted  by 
$.  The  lower  value  of  the  game  is  given  by 

WTx)  =  inf  sup  P(x,<j>[w\,w).  (2.6) 

<t>€^  u,£W 

An  analogous  definition  holds  for  the  upper  value.  If  the  upper  value  and  the  lower  value 
are  identical,  then  the  game  has  value  in  the  Elliott-Kalton  sense. 

The  Isaacs  equation  corresponding  to  this  game  is  given  by 

e  =  H(x,vw)  xeG 

(2. 1 ) 

W(x)  =  0  xedG 

where  dG  is  the  boundary  of  G  and  H  is  given  by 

H(x,p)  =  mm[f{x,u)-p]+  max  (a{x)w)T  p  -  ^|tn|2 

u£U  w£dtn  £ 

-  min/(x,ti)  -p+  \pTa{x)<rT{x)p. 

Suppose  that  /  and  o  are  Lipschitz  and  that  <raT  is  uniformly  non-degenerate,  i.e.  there 
exists  p  >  0  such  that 

ZTc(x)aT(x)Z  >  mKI2  Vx  €  G,  ve  €  5Rn. 


4 


Then  the  game  has  value.  V%  and  this  value  is  the  unique  continuous  viscosity  solution  of 
the  Isaacs  equation  [DMc],  [McD]. 

Let  us  clarify  the  robust  interpretation  of  this  game  described  above.  Suppose,  for 
instance,  that  there  exists  an  optimal  feedback  control  for  the  minimizing  player  in  this 
game.  This  implies  that 


W{x) 


>-j:h 


M5 


dt  vwer. 


Let  Q{x)  =  — TV(ar)  and 


W 


,p 


=  ju;  €  I2[0,oo)  :  ^  \w\2  dt  < 


P  VO  <  T  <  oc 


} 


(2.8a) 


Then  the  (2.8a)  has  the  interpretation 

r>_Q(f)_  Vu;eWp.  (2.&b) 

~  6 +  \P 

This  is  a  lower  bound  on  the  escape  time  as  a  function  of  the  average  power  of  the  input 
noise.  It  is  directly  analogous  to  the  attenuation  bound  of  H«  control,  as  well  as  to  finite 
power  gain  control  [DJ]. 

We  now  provide  some  simple  examples  as  a  means  of  illustrating  these  concepts.  (The 
Compressor  Stall  application  will  be  discussed  in  the  next  section.)  The  absolutely  simplest 
example  is  a  one-dimensional  system  of  the  form 

~  =  u(t)  +  w{t) 
at 

A"(0)  =  i?G. 


Let  c  =  (-1,1),  that  is  we  wish  to  keep  the  state  between  -1  and  1.  Let  the  control  take 
values  in  tb-  set  [-1,1],  so  that  we  are  limited  by  1  in  the  magnitude  of  our  control  effort. 
The  payoff  is  simply 

P(x,u(),it'(-))  =  “ 

and  we  take  the  design  parameter  e  =  1.  Then  the  Isaacs  equation  (2.7)  takes  the  form 


i: 


( 0  +  u>2(0) 


0  =  min  [uWx]  +  ma x[wWx  -  \w2] 

=  -|W*|  +  \w2x 

WX-l)  =  W(l)  =  0 
W'hich  has  the  viscosity  solution 

W(x)  =  2(|i|  —  1). 


The  optimal  control  is  given  by 


um(t)  g  argminee[_ltl][«Wr. 


-1 

1 


if  X(t)  >  0 
if  A'(t)  <  0 


5 


and  «*(<)  can  take  any  value  in  [-1,1]  when  X(t)  =  0.  This  optimal  control  is  obvious  of 
course;  you  always  push  as  hard  as  you  can  away  from  the  closer  boundary.  We  obtain  a 
bang-bang  control  due  to  the  fact  that  we  do  not  put  any  cost  on  the  control. 

A  slightly  more  complex  example  is  as  follows.  Consider  a  two-dimensional  system  of 

theform  dX  (l  0\  ./1V/V5  0\ 

■5T=U  2Ju‘+(iJ  +  lo  iju< 

which  represents  non-degenerate  linear  dynamics  plus  a  constant  drift  in  the  (1. 1)  direction. 
Let  the  set  we  wish  to  keep  it  in  be  the  rectangular  region 

n  =  m.2f  x  fo.  lb 


and  let  the  control  take  values  in  the  disk 

U  =  {v  €  ft2  :  |v|  <  4} 

so  that  we  are  combining  a  variety  of  geometrical  forms  here.  Let  the  design  parameter 
be  6  =  1.  This  problem  is  still  analytically  solvable,  although  the  algebra  gets  a  bit  more 
tedious.  The  solution  is  piece-wise  planar.  This  solution,  W,  is  depicted  in  figure  2.1.  It  is 
plotted  as  a  function  over  the  region  G  given  above.  (The  scalloping  is  due  to  the  plotting 
software;  the  true  solution  looks  like  four  planes  descending  from  the  four  boundaries  of  the 
set  G .)  In  each  of  the  four  planar  regions,  the  optimal  control  is  given  bj- 

«-(*)=  (J  tyvw(X(t)). 

Also,  along  the  edges  where  these  planar  regions  meet,  the  optimal  control  can  be  chosen  fo 
be  any  value  in  the  subdifferential  at  those  points. 


6 


Figure  2.1:  Two-dimensional  example 


As  a  last  example,  we  consider  a  degenerate  problem.  In  real-world  systems,  the  state 
often  represents  a  vector  of  objects  such  as  position  and  velocity.  In  such  a  case,  some  of  the 
component  equations  are  exact  (eg.  the  derivative  of  position  is  velocity),  and  one  would 
not  expect  to  have  disturbances  affecting  those  components  of  the  system.  Such  systems 
are  often  referred  to  as  degenerate  (or  not  uniformly  elliptic),  and  the  theory  is  often  not 
fully  developed  in  those  cases.  For  general  degenerate  problems,  the  solutions  of  the  Isaacs 
equations  may  not  be  continuous,  however  for  degeneracies  of  this  form  we  have  shown  that 
they  are  continuous.  Consequently,  one  does  not  need  the  tremendously  difficult  machinery 
of  discontinuous  viscosity  solutions.  We  consider  a  simple  such  example.  Let  the  system  be 


given  by 


dJL  =  V 
dt 
dV 

_  =  U  +  w 


where  A'  represents  position  and  V  represents  velocity.  Let  the  control  magnitude  be  bounded 


as 


n  =  (v  (=  »  :  lul  <  41. 


Let  the  region  wre  are  trying  to  keep  the  state  in  be 


G  =  (-1, 1)  x  (-4, 4). 

This  is  analogous  to  a  tracking  system  where  one  needs  to  keep  the  state  in  a  field-of-view 
(in  this  case  A  e  (-1, 1))-  Further,  in  such  a  tracking  situation,  the  state  cannot  be  allowed  to 
move  too  quickly  across  the  field-of-view;  otherwise  the  image  does  not  dwell  long  enough  on 
individual  detectors  to  register.  Consequently  we  include  a  bound  V  €  (-4,4)  in  our  example 
region.  We  again  take  the  design  parameter  8  =  1  for  simplicity.  In  this  case,  the  Isaacs 

equation  takes  the  form 

0  =  —6  +  vWx  +  min  [uW„]  +  max  [w\Vv  -  |u>|2] 

|u|<4  »€» 

W2 

=  -l  +  vWx-4\Wv\  +  -f-. 

We  again  computed  an  analytical  solution,  although  in  this  case  a  computer  algebra  system 
(Maple)  was  used  as  an  aid.  The  solution  is  depicted  in  figure  2.2.  Note  that  the  solution  is 
still  continuous,  although  it  does  have  a  cusp  along  a  curve  across  the  region.  The  optimal 
control  is  given  by  u*(t)  =  ±4  depending  on  which  side  of  this  curve  the  state  lies  on  at  a 

given  moment. 


Figure  2.2:  Second-order  example  appears  on  next  page 


i 


2.1.2  Application  to  Compressor  Stall 


The  compressor  stall  problem  is  one  which  may  be  particularly  suited  to  a  Robust  Escape 
formulation.  The  specific  aspect  of  compressor  stall  that  makes  Robust  Escape  appropriate 
is  the  fact  that  entry  of  the  state  into  rotating  stall  is  catastrophic  in  the  sense  that  it  may 
require  shut-down  and  restart  of  the  system.  One  could  alternatively  employ  nonlinear  H« 
control  to  stabilize  the  system  around  some  point  different  from  stall.  However,  that  does 
not  capture  the  essence  of  avoiding  the  specific  region  around  the  rotating  stall  equilibrium. 
Consequently,  we  have  chosen  to  work  with  a  Robust  Escape  formulation. 

We  use  a  compressor  model  adapted  from  [MG]  in  the  form 


=p-  =  oR{  1  -  4>2  -  R) 

at 


dtp_  _1_ 
dt  ~  P 


+  i>cO  +  1  +  "  2^3  “ 

<t>+  1  “  7\Z^*] 


(2.9a) 

(2.96) 

(2.9c) 


where 


R=l 
4> 

,  v 

* 

where  6  represents  the  circumferentially-averaged  axial  flow  coefficient,  rt-  represents  the 
pressure  rise,  J  represents  the  squared  amplitude  of  the  angular  variation,  and  the  parameter 
H  is  taken  from  [B]  as  H  =  0.32.  Further,  the  time  variable,  t,  is  scaled  from  real-time,  <.  by 
t  _  where  [B]  lc  =  6  and  W  =  0.18.  The  remaining  coefficients  are  given  by 

3  alc 


a  = 


(1  +  ma) 
W 

4  B2H2 


P 

,  i’co 

*«  =  -H 


7  =  7  \Z~H 

where  from  [B],  we  take  a  =  1/3,  m  =  2,  =  0.23,  B  =  0.1  and  7  €  (0, 1).  Consequently,  we 

find  a  =  3.6 

—  =  44. 

ilco  =  0.72 
7  =  7v/0l2. 

The  corresponding  compressor  characteristic  , 

f(4>)  =  V'co  +  1  +  2^ -  2^ 


9 


is  depicted  in  figure  2.3.  We  consider  a  fixed  value  of  the  throttle  setting  7  =  0.5.  For  this 
value  of  7,  there  is  a  rotating  stall  equilibrium  of 

R  =  0.923,  <6  =  —0.278,  ^  =  2.083 


Figure  2.3:  Compressor  characteristic 

For  a  specific  example,  we  allow  disturbance  to  affect  all  components  of  (2.9),  and  allow 
a  control  of  the  second  component  (2.9b).  In  this  case,  one  has  a  model  of  the  form 

^  =  aR{l  -  -  R)  +  toi  (2-10a) 

at 

=  -v'*  +  rpco  +  l+l<l>-  iU3  -  WR  +u  +  W2  (2-106) 

at  A  A 

^=^2  <t>+  1-7n/^]  +w3.  (2.10c) 

Letting  X  =  (XUX2,X3)  =  (R,4>,ip)  and  w  =  (w1,w2,w3),  this  can  be  rewritten  as 

0\  /  vXi(l  —  Xjj  —  A'i)  \  /0\ 

U  I  +  t?  =  I  —  X3  +  IpcO  +  1  +  f-W  —  4j[f  —  3A2X3  I  +  j  u  I  +  W.  (2-1 1 ) 

0/  \  jrA’2  +  1  —  I'JXz  )  \  0  / 

This  particular  model  has  been  chosen  simply  as  an  example.  It  is  not  intended  to 
suggest  that  this  is  in  any  way  the  optimal  choice  of  models.  Continuing,  to  be  specific,  we 
allow  the  control  u(-)  to  take  values  only  in  the  interval  U  =  [0,2]. 

Finally,  we  let  the  region  of  interest  be  given  by 

GifXeS3:  *3  >3},  (212) 


10 


i.e.  the  set  of  states  where  il>  >  3.  This  region  was  chosen  so  as  to  be  as  simple  as  possible  and 
yet  not  include  the  rotating  stall  equilibrium  computed  above  as  being  at  (0.923,  -0.278, 2.083). 
For  this  problem,  the  Isaacs  equation  is  (2.7)  with  the  particular  Hamiltonian 


H(x,p)=  min  {[F(x)  +  (0,  u,0)T]Tp}  +  max{u;Tp+ 

u€[0,2]  ^  13^ 

=  min  {[F(x)  +  (0,u,0)T]Tp}  +  hp\2. 

u€[0,2]  ^ 

In  the  theory  of  the  previous  section,  it  was  indicated  that  the  value  of  the  game  was  the 
viscosity  solution  of  the  Isaacs  equation  in  the  case  that  G  was  compact.  Here  we  have  an 
unbounded  G.  It  remains  to  prove  that  the  result  remains  true  in  this  case.  Assuming  that 
this  is  in  fact  true,  we  see  by  examining  (2.13)  that  once  the  solution  has  been  computed, 
the  optimal  control  will  take  the  bang-bang  form 


if  WXa(X(t))  <  0 
if  Wx,(X(t))  >  0 


(2.14) 


where  Wx2  is  the  partial  of  the  value  with  respect  to  the  second  component  (<p). 

In  [McDa],  it  is  shown  that  under  certain  conditions,  the  viscosity  supersolution  of  the 
Isaacs  equation  corresponding  an  Hco  control  problem  can  be  obtained  by  propagating  the 
characteristic  equations  over  the  state-space,  and  then,  for  each  point  in  the  state-space, 
taking  the  minimum  value  over  all  the  characteristics  which  pass  through  that  point.  We 
expect  this  algorithm  to  hold  (with  maximum  replaced  by  minimum)  for  the  case  of  Robust 
Escape  problems  as  well.  Specifically,  for  this  problem  the  characteristic  equations  are  the 
6-dimensional  set  of  ODE’s 

^  =  HP(X,  P)  =  F(X(t))  +  (0,  um(t),  Of  +  P(t) 

dt  (2.15) 

^  =  -Iix(X,P)  =  -Fj(X(t))P 

where  um(t)  =  argmaxuet7[u7M0L  and  Fx  is  the  matrix  of  partials  of  F.  One  must  specify 
the  initial  conditions  for  (2.15).  In  particular,  one  must  propagate  (2.15)  starting  from  a 
sufficiently  dense  network  of  initial  points  on  the  boundary  of  G  to  obtain  an  approximation 
of  the  value.  Thus,  one  has  Aa(0)  =  3  by  (2.12),  and  then  we  let  JVi(0)  =  xj  and  A2(0)  =  X2  range 
over  various  points  on  the  boundary.  Since  the  value  is  identically  zero  on  the  boundary,  this 
implies  that  the  partials  in  the  plane  of  the  boundary  are  zero  so  that  Pi(0)  =  0  and  P2(0)  =  0. 
Finally,  given  all  of  the  other  initial  conditions,  one  can  obtain  P3(0)  from  (2.7),  (2.13).  In 
particular,  with  the  above,  (2.7)  becomes 

0  =  -6  +  ^(x2  +  1  -  yVZ)P3(0)  +  ip32(0) 


which  is  simply  a  quadratic  equation.  This  yields 

> 


P3(0)  =  [(x2  +  1  -  tv^)  +  \[{X2  +  1  -  7v/3)2  +  2/?4(3  +  1) 


11 


where  we  have  taken  the  positive  root  in  order  to  obtain  the  characteristic  trajectories 
which  enter  the  set  G.  The  value  along  each  characteristic  path,  which  we  will  denote  by 
V'(<;X(0),P(0)),  is  given  by 

vm= i\pTd^\dL  t2i6) 

The  value  at  each  point  x  €  G  is  obtained  by  considering  all  the  characteristic  paths  such 
that  X(t)  =  x  for  some  t  >  0  (and  some  initial  condition  (X(O).P(O))),  and  their  corresponding 
values  V(t\  A'(0),  P(0)).  The  maximum  such  value  yields  the  value  function  at  x.  (In  the  actual 
software,  the  region  G  is  divided  into  cubes,  and  the  maximum  value  over  all  trajectories 
passing  through  that  cube  is  assigned  to  be  the  value  for  that  cube.) 

This  method  has  been  employed  for  the  above  problem.  To  complete  the  specification 
of  the  problem,  the  value  of  the  design  parameter  9  was  taken  to  be  simply  9  =  1.  The  value 
function  was  computed  over  a  portion  of  the  space  by  the  above  method.  The  subset  of  G 
over  which  the  value  was  computed  was  ( R ,  <f>,  ip)  e  [0.5, 2]  x  [-1, 2]  x  [3, 7].  The  computation  time 
on  a  SPARCstation  5  was  less  than  a  minute.  Since  the  value  is  defined  over  a  3-dimensional 
region,  it  is  difficult  to  depict  graphically.  We  display  Q,  that  is  the  value  multiplied  by  -1. 
(Recall  the  robust  interpretation  (2.8b).)  This  is  displayed  over  a  slice  of  the  region  given  by 
R  =  0.5,  and  is  depicted  in  figure  2.4.  Recall  that  G  was  the  region  where  V  >  3,  and  so  the 
value  is  zero  at  ip  =  3  and  positive  above  that.  Recall  from  (2.14)  that  the  optimal  control 
is  2  where  the  partial  of  Q  =  -W  with  respect  to  <p  is  positive  and  0  elsewhere.  Considering 
figure  2.4,  one  immediately  sees  that  in  this  case  the  control  would  be  2  for  most  values  and 
0  in  the  corner  where  o  is  large  and  small.  One  might  also  note  that  in  figure  2.4.  the 
value  tends  to  separate  from  the  boundary  value  W(R,  <p,3)  =  0  for  larger  values  of  6.  This 
is  due  to  the  fact  that  the  worst-case  trajectories  starting  out  at  say  (0.5, 0.5, 3.1)  would  not 
escape  from  a  nearby  point  on  the  boundary  but  instead  circle  upwards  and  then  back  down 
and  out  at  a  place  further  down  on  the  boundary. 


12 


The  results  depicted  in  figure  2.4  should  not  be  interpreted  as  a  final  (or  even  serious) 
solution  to  the  Robust  Escape  approach  to  compressor  stall.  They  have  been  compute 
merely  to  indicate  that  initial  software  for  this  problem  has  been  generated,  and  is  generating 
reasonable  results.  However,  the  model  considered  above  is  insufficient  in  several  respects. 
First  of  all,  the  relative  sizes  of  the  noise  in  the  various  components  of  the  dynamics  has  not 
been  addressed  at  all.  To  produce  more  serious  results,  one  would  need  to  communicate  with 
those  modeling  the  dynamics  so  as  to  obtain  a  reasonable  disturbance  model.  The  overa 
expected  sizeof  the  noise  is  also  unknown  (as  well  as  the  controls),  so  that  it  is  not  possible 
to  tune  the  system  design  parameters  to  a  reasonable  set.  Lastly,  the  region,  G  was  chosen 
to  be  a  half-space  for  simplicity.  In  a  more  reasonable  model,  one  would  need  to  consider 
other  boundaries  as  well;  certainly  one  would  not  want  the  state  to  move  arbitrarily  far  in 

any  direction. 

One  can  use  the  controls  obtained  in  the  above  manner  in  a  simulation  to  verify  the 
robustness  of  the  controller.  This  has  been  delayed  for  the  compressor  stall  problem  until 
we  have  better  noise  and  control  models.  To  see  how  this  may  be  done,  refer  to  the  end  of 
the  Aerospace  Plane  Ascent  control  discussion  in  Section  2.2.2. 


2.2  Nonlinear  Hx  Control  Problems 


13 


In  this  section,  we  discuss  some  theoretical/numerical  results  concerning  nonlinear  H„ 
control  and  some  remaining  open  questions.  We  also  indicate  how  the  theory  and  numerics 
can  be  applied  in  a  particular  case  -  Aerospace  Plane  Ascent. 


2.2.1  General  Theory 

Once  again,  we  suppose  the  dynamics  are  of  the  form  (2.1).  Let  L(x.u)  be  our  cost 
criterion.  In  nonlinear  Hoc  control,  we  choose  a  disturbance  attenuation  constant,  7,  and 
look  for  a  feedback  control  ti*(x)  such  that 


T 

JT  L{X(t),u(X(t)))dt  <  72^  |u>(t)|2  dt  +  W(x) 


for  some  function  of  initial  condition  W(*)  for  all  w  €  1,(0,  T)  for  all  T<  oc.  Ideally  one  would 
like  7  to  be  nearly  as  small  as  possible.  However,  it  is  difficult  to  find  the  smallest  feasible 
7,  and  further  in  some  cases  this  leads  to  controls  which  are  undesirable  for  other  reasons. 
Consequently,  one  most  often  chooses  a  value  of  7  for  which  a  solution  exists,  and  searches 
for  controls  corresponding  to  that  attenuation  level  7. 

Consider  the  differential  game  with  dynamics  (2.1)  and  payoff  and  value  given  by 


P(x,u,w,T) 


=  f  L(X(t),e[w](t))-i2\wt\-dt 

Jo 


Wlx)  =  inf  sup  sup  P(x,4>[u:],tr) 

<£€#  W  T<c o 

where  the  definitions  of  $  and  W  can  be  found  in  Section  2.1.1.  If  one  obtains  an  optimal 
feedback  control  u‘(x)  for  the  game,  then 


sup  sup  [  L{X{t),u"{X(t))) - 
m€VV T <cc  Jo 


which  implies  that  (2.17)  is  satisfied.  More  generally,  fix  a  feedback  S(x).  and  consider 


W(x)  =  sup  sup  f  L(X(t),  u(X(t)))  —  72|u;(|~  dt. 
u>ew  T<0C  Jo 


(2.19) 


If  this  problem  has  finite  value  W,  then  (2.17)  is  satisfied  with  attenuation  7  and  storage 
The  Isaacs  equation  corresponding  to  (2.18)  is 


0  =  inf  sup  {[/(x,u)  +  ff(*MTVW’  +  L(x,u)-72M2} 

=  inf  {/T(x,ti)Vfy+L(x,u)}  +  T^V^T<T(x)crT(x)Viy 
u eu  1  47 

iJ?(x,VW). 


(2.20) 


In  the  case  where  we  take  a  feedback  u(x),  we  can  rewrite  the  dynamics  as 
X  =  /  (X(t))  +  a(X(t))w(t)  =  u{X(t))  +  <r(X(t)M<) 


14 


(2.21) 


and  the  cost  as  L(x)  =  L(x,u(x)).  Then,  in  this  case,  (2.20)  is  replaced  by 

0=  {J(x)VW  +  L{x)}  +  ^2VVV'T<7(z)crT(j;)VW' 

=  H(x,VW). 

Note  that  (2.20),  (2.21)  do  not  involve  W  itself,  only  its  gradient.  Consequently,  we  normalize 
the  solutions  by  requiring  W(0)  =  0  as  part  of  the  solution.  (Otherwise,  if  \V{-)  is  a  solution, 
then  so  is  W(-)  +  k  for  any  k  €  3R.) 

Tvpically,  one  considers  quadratic  cost  criteria  such  as 

L{x,  u)  =  |x|2  +  |u|2.  (2.22) 

If  the  nonlinear  theory  can  handle  such  cost  criteria  (with  Lipschitz  dynamics),  then  it  can 
subsume  the  linear-quadratic  theory.  One  would  like  to  compute  the  control  by  solving  the 
Isaacs  equation  (2.20),  or  in  the  case  of  a  trial  control  5,  compute  the  solution  of  (2.21)  to 
determine  if  u  is  indeed  an  Hcc  controller  (which  will  be  true  if  the  solution  exists  and  is 
nonnegative  [So]). 

Unfortunately,  under  quadratic  growth  assumptions  such  as  (2.22),  there  exists  multiple 
solutions  to  (2.20)  and  (2.21).  For  instance,  consider  the  simple  one-dimensional  case  where 

7(x)  =  -x,  L(x)  =  x 2  and  a(x)  =  1.  Let  7  >  1.  (For  7  <  1,  there  does  not  exist  a  solution, 

and  one  does  not  have  control  for  this  system  with  such  low  attenuation.)  Then  (2.21) 

becomes  . 

0=  7^--  xWrx  +  x2.  (2.23) 

47J 

This  has  two  classical  solutions  with  Wr(0)  =  0: 

Wl(x)  —  72(1  —  \f\  —  1/72)x“  (2. 24a) 

tP2(x)  =  72(1  +  v/1  -  1/72)*2  02-246) 

and  an  infinite  number  of  viscosity  solutions  Consequently  it  is  important  to  be  able  to 
distinguish  the  correct  one  in  any  real  computational  approach.  Soravia  [So]  indicated  a 
connection  between  the  value  or  storage  W  for  (2.19)  and  the  smallest,  nonnegative  viscosity 
supersolution.  However,  this  condition  (i.e.  smallest)  is  difficult  to  establish  in  practice. 
In  [Mcel],  we  demonstrate  that  there  exists  a  unique  viscosity  solution  to  (2.21)  satisfying 
a  certain  quadratic  growth  condition  (Wr(x)  <  fc|x|2  for  a  particular  £),  and  that  this  is  the 
correct  solution  for  our  purposes.  In  the  above  example,  this  solution  is  the  classical  solution 
given  by  (2.24a).  Under  certain  conditions,  we  have  also  shown  this  for  Isaacs  equation  (2.20); 
however  a  general  proof  in  that  case  remains  an  open  question.  (We  should  note  that  some 
authors  (eg.  [vdS])  have  demonstrated  for  certain  nonlinear  systems  that  on  some  arbitrarily 
small  region  around  the  origin,  the  correct  solution  corresponds  to  the  stable  manifold  of  the 
Hamiltonian  system;  this  corresponds  to  our  solution.  However,  our  solution  is  global  not 
just  local  on  some  possibly  very  small  neighborhood.) 


15 


Once  one  is  able  to  separate  correct  solutions  from  artifacts,  one  is  still  left  with  the  ques¬ 
tion  of  actual  computation.  Since  (2.20),  (2.21)  are  first-order  PDEs,  a  possible  approach  is 
a  generalization  of  the  method  of  characteristics.  Note  that  there  does  not  exist  a  boundary 
or  an  initial  time  from  which  to  begin  propagating  these  characteristics;  in  this  HTO  situa¬ 
tion,  one  needs  to  propagate  the  characteristics  outward  from  the  origin  backward  in  time 
(i.e.  backward  along  the  stable  manifold).  In  [McDa],  we  demonstrate  under  rather  strong 
assumptions  that  a  generalized  method  of  characteristics  yields  a  viscosity  supersolution  to 
the  Isaacs  equation.  Specifically,  one  takes  (2.20)  or  (2.21)  and  forms  the  characteristic 

equations 

X  =  -Hp(X,P)  (225a) 

P  =  Hx(X,P)  (2-256) 


to  which  we  append 


U  =  P-X  =  -P-Hp(X,P). 


(2.25c) 


In  an  ideal  situation,  one  would  have  the  solution  given  by 

W(x)  =  U(X(t)) 

for  the  characteristic  such  that  X(t)  =  *.  That  is,  U(t)  would  carry  the  value  of  the  solution 
at  X(t)  along  each  characteristic.  This  is  what  occurs  in  the  classical  case.  However,  for 
our  problems,  one  cannot  guarantee  that  there  is  a  unique  characteristic  whose  projection 
passes  through  a  given  point  in  the  state  space.  (In  fact,  merely  guaranteeing  that  any 
characteristic  has  such  a  projection  is  non-trivial!)  Let  us  index  a  collection  of  solutions  of 
(2.25)  by  some  abstract  index  set  b  <=  B,  that  is  let  (X{  \b),  P{-.b),U(-,b))  be  a  solution  to  (2.2o) 
satisfying  some  initial  condition.  Let  B  be  some  index  collection  such  that  for  all  b  e  B  one 

haS  H(X(t-,b),P(t-,b))  =  0 

lim  X(t;  b)  =  0,  limP(t;6)  =  0  (2.26) 

t  — 00  <—oo 

lim  U(t;  b)  =  0. 

t— >oc 

Under  certain  conditions,  a  viscosity  supersolution  is  given  by 

W{x)  =  inf {U(t\b) :  beB,  t  G  3?  s.  t.  X(t;6)  =  x}.  (2.27) 

(Note  that  (2.27)  is  similar  to  formulae  appearing  in  [Sua]  and  [Mi]  for  optimal  control 
problems.) 

One  can  then  apply  the  controller  obtained  from  this  approach  in  a  simulation  to  demon¬ 
strate  that  one  has,  in  fact,  achieved  a  true,  global,  nonlinear  Hoc  controller.  This  is  discussed 
for  a  particular  problem  in  the  next  section. 


2.2.2  Application  to  Aerospace  Plane  Ascent 


16 


The  first  part  of  this  section  (up  to  equation  (2.31))  recaps  the  discussion  appearing  in 
[McMl]. 

Under  reasonable  assumptions,  the  translational  motion  of  the  Aerospace  Plane  center- 
of-mass  in  the  post-transonic  portion  of  atmospheric  ascent  is  given  by 


where 


and 


dE  V(T  -  D) 
dt  ~  m 
dm  __  T 

dt  9eLp 

dr  . 

—  ==  V  sin  7 
dt 

dy  L  f  V^\  cos  7 

~dt  ~  mV  \r2  r  )  V 

r  =  radial  distance  of  the  vehicle  from  Earth  center, 
V  =  vehicle  speed, 

£  =  Y—  -  tL  -  specific  energy, 

ft  =  gravitational  constant  of  the  Earth, 

7  =  vehicle  flight  path  angle, 
m  =  vehicle  mass, 

T  =  propulsive  force, 

D  =  drag  force, 

L  =  lift  force, 

gE  =  gravitational  acceleration  at  sea  level. 

Isp  =  specific  impulse  of  fuel. 


The  two  time-scale  approach  of  Mease  et  al.  ([MVB1],  [MK])  will  be  used  to  separate 
the  dynamics.  In  particular,  E  and  m  change  only  rather  slowly,  while  r  and  7  change  more 
rapidly  One  may  determine  a  fuel  optimal  trajectory  for  the  slow  variables  (E,m).  Then, 
with  respect  to  the  fast  variables  (r,T),  the  values  of  (E,m)  may  be  assumed  to  be  at  some 
steady-state  values  (£„m,)  at  any  given  time.  The  control  objective  for  the  fast  variables 
is  then  to  continually  drive  the  state  back  to  the  desired  slow  manifold  while  it  is  being 
perturbed  by  noise.  A  nonlinear  Hoc  approach  to  this  problem  for  the  fast  dynamics  follows. 

Let  E„m„ra,ya  be  the  desired  steady-state  (relative  to  the  fast  dynamics)  values  of  a 
point  on  the  slow  manifold.  Let  (r,  7)  be  the  fast  dynamics  values  at  some  time,  and  let 
Ar  =  r  —  r,  and  A7  =  7  —  7S  •  Let  V  and  Va  satisfy 


and 


17 


respectively.  Then  the  relevant  dynamics  become  (see  [MVB1]) 


where 


dAr 

dt 

dAy 

~dT 


V  sin  7  -  Vs  sin  7,  = 


cos  7, 


L,  =  the  desired  value  of  the  lift  for  the  slow  manifold, 
Lj  =  the  lift  control  for  the  fast  dynamics, 
w  =  the  perturbing  noise, 
a  =  the  noise  coefficient. 


(2.28) 


For  a  first  attempt,  a  payoff  for  the  nonlinear  Hoo  game  (see  (2.18))  was  chosen  to  be  a 
simple  quadratic  of  the  form 

P((Ar,  Ay),  Lj ,w,T)  =  j  Ar~  +  C\Ay~  +  C2Lj  —  0~ w~  dt.  (2.29) 

Jo 

(The  case  where  additional  terms  are  included  in  P  to  account  for  undesirable  regions  in  the 
state  space  has  also  been  examined.  However,  for  the  sake  of  brevity,  such  results  are  not 
included  here.)  To  further  simplify  the  first  attempt,  it  has  been  assumed  that  the  control 
may  be  unbounded.  Note  that  if  one  does  not  place  a  bound  on  the  size  of  +he  noise,  then 
one  must  allow  possibly  unbounded  controls  as  well  in  order  for  the  Hec  problem  to  have  a 
solution.  The  resulting  Hamiltonian  for  (2.20)  then  becomes 


H(Ar,Ay,  Wi,  W2)  =  minmaxj  +  W?  ( /(2)  +  —jj  + 

Lj  w  l  \  msV 

+  A r2  +  C1A72  +  C%L?j  -  02w2  j 


=  +  w2fm  +  7 


1 


4  [62  C2m2V2 J 


W,2  +  Ar2  +  C1A72. 


(2.30) 

(2.31) 


In  order  for  the  problem  to  have  a  feasible  solution,  we  must  choose  the  coefficients  such 
that  fr  -  <  °- 

The  generalized  method  of  characteristics  described  in  the  previous  section  is  used  to 
generate  the  solution.  Then,  one  obtains  the  HTO  feedback  controller  from 


VF2(Ar,A7) 

J  2C3msV(Ar)' 


(2.32) 


18 


The  possible  real-time  algorithm  is  as  follows  (refer  to  figure  2.6).  Given  a  particular  value 
of  (Ar,  A7),  one  may  search  for  the  characteristic  whose  state  component  propagates  outward 
from  the  origin  to  (Ar,  A7).  One  may  store  some  characteristics  such  as  those  depicted  in  the 
figure.  Then  for  any  (Ar,A7),  one  has  bounds  on  the  correct  values  of  (W’i.Wj)  which  cause 
a  characteristic  propagating  backwards  from  (Ar,  A7,  Vt'i,  W2)  to  return  to  the  origin.  The 
bounds  are  obtained  by  examining  the  characteristics  on  either  side  of  (Ar,  A7).  (That  is. 
one  has  bounds  on,  say  W2,  and  then  one  obtains  Wx  from  Ar,  A7,  W2  and  the  PDE.)  Then 
one  applies  a  shooting  technique  to  find  the  correct  starting  value  for  W2  and,  consequently. 
Wi.  This  is  analogous  to  a  standard  approach  for  finding  the  initial  values  of  the  costate 
variables  in  optimal  control  theory. 


Figure  2.6:  Projection  of  characteristics 

This  approach  was  embedded  in  a  simulation,  and  the  disturbance  attenuation  property 
was  verified. 


3.  Nonlinear  Robust  Filtering 

In  this  section,  we  discuss  some  recent  advances  we  have  made  in  nonlinear  Robust 
Filtering.  We  describe  the  theory  and  some  numerical  methods.  As  additional  support  for 
this  viewpoint  on  filtering,  we  also  present  a  short  discussion  of  the  Robust  Filter  as  the 
limit  of  the  stochastic  Risk-Sensitive  Filter. 


19 


3.1  Motivation  and  Theoretical  Development 

Robust  Filtering  is  the  natural  extension  of  Robust/H,*,  control  methods  to  the  filtering 
and  estimation  situation.  The  approach  can  be  formulated  as  a  deterministic  game  where 
the  disturbance  is  chosen  by  an  antagonisitc  player  trying  to  degrade  our  filter  performance. 

The  fact  that  the  disturbances  are  chosen  by  an  antagonistic  player  (trying  to  degrade  our 
filter  estimate  quality)  rather  than  being  random  leads  naturally  to  a  worst-case  based  strat¬ 
egy.  This  is  especially  appropriate  for  systems  where  one  desires  to  track  a  non-cooperative 
target  [BWGH].  In  such  a  case  the  state  would  not  be  expected  to  follow  a  nominal  path 
with  diffusion-type  disturbances  around  that  trajectory. 

The  robust/Hoo  approach  uses  a  dynamical  model  of  the  form 

“=/(*) +  <**>»  ,3.1) 

X(0)  =  X0 

(where  we  are  not  assuming  zo  known  a  priori).  X  is  the  state,  /  represents  the  nominal 
dynamics,  w  is  a  deterministic,  but  (a  priori)  unknown  L2  process,  and  a  is  some  multiplier 
on  the  disturbance.  This  model  is  in  contrast  to  the  diffusion  model 

dXt  =  f(X)  dt  +  <r(X)  dBt 

where  B  is  a  Brownian  motion.  The  fact  that  the  disturbance  in  the  robust  model  is  an  I2 
process  rather  than  a  Brownian  motion  has  important  numerical  ramifications.  Specifically, 
in  the  case  of  a  stochastic  model,  the  nonlinear  filter  takes  the  form  of  a  second-order  PDE 
often  referred  to  as  a  Zakai  or  Kushner  equation.  On  the  other  hand,  the  Robust  Filter 
will  lead  to  a  first-order  PDE  similar  to  those  discussed  in  the  previous  section.  One  may 
take  advantage  of  the  first-order  nature  in  ways  which  are  not  possible  for  second-order 
equations.  In  particular,  the  information  is  carried  along  characteristics  which  propagate 
at  a  finite  rate,  thus  allowing  one  to  use  numerical  approaches  such  as  the  one  described 
below.  In  contrast,  for  a  second-order  equation,  a  small  change  in  the  initial  conditions  at 
one  point  propagates  at  infinite  speed  throughout  the  system.  Thus  in  that  case,  one  is 
required  to  solve  the  PDE  on  the  whole  space  (or  some  approximating  large  set)  rather  than 
just  on  some  narrow  manifold  containing  the  optimal  estimate.  This  difference  is  critical  to 
the  question  of  feasible  real-time  estimation  and  tracking. 

We  use  an  analogous  measurement  model  of  the  form 

z{t)  =  g(X(t))  +  p(X(t))v(t)  (3.2) 

where  z  is  the  measurement  process  taking  values  in  3?',  g  is  the  nominal  measurement  model, 
v  e  L2  is  a  finite-energy  noise  signal  taking  values  in  3?*  (where  k  >  l ),  and  p  is  some  multiplier 
on  the  noise.  One  can  view  w  and  v  together  as  antagonistic  to  our  goal,  and  attempt  to 
attenuate  their  effect.  This  will  be  called  Robust  (or  Hoo )  Filtering. 


20 


We  also  note  that  some  of  this  work  harks  back  to  Mortensen  [Mo]  and  Hijab  [H],  but 
is  now  set  in  the  framework  of  nonlinear  Hec  control  and  viscosity  solutions.  In  particular, 
the  information  state  is  quite  close.  However,  this  information  state  is  interpreted  in  an 
Hoc-framework  which  then  leads  to  a  different  estimate.  This  new  estimate  is,  in  fact,  the 
estimate  which  corresponds  to  the  risk-sensitive  limit  of  the  stochastic  model.  This  work  is 
also  closely  related  to  the  study  of  Hoc  control  under  partial  observations  [JBE].  However,  in 
this  case,  the  chosen  criterion  leads  to  a  finite— dimensional  dynamic  programming  equation 
as  opposed  to  the  infinite-dimensional  PDE  of  [JB] . 

In  analogy  with  Hoo  control,  the  Robust  Filter  will  provide  an  estimate  meeting  a  dis¬ 
turbance  attenuation  bound  of  the  form 

\xT  -  er  |2  <  72  M*o)  +  IMI2  +  IMI2] 

where  xj  is  the  true  state  at  time  T  and  ej  is  our  robust  estimate  at  time  T.  This  is  a  bound 
on  the  estimate  error  in  terms  of  the  I2-norm  of  the  disturbances. 

We  assume  that  /,  <r,  g  and  p  are  all  continuously  differentiable.  We  assume  that  /,  g  and 
cr  are  globally  Lipschitz  in  x ,  that  <7  is  bounded,  and  that  a  =  <raT  is  uniformly  non-degenerate. 
We  also  assume  that  Range(p(x))  =  5ft'  for  all  x  €  3ftn  which  guarantees  that  for  any  z,x  there 
exists  some  v  satisfying  (3.2)  which  should  certainly  be  the  case  if  our  measurement  model 
is  properly  constructed.  Finally,  we  define  p_1  by 

p~1(x)b  =  argmin{|v| :  p(x) v  =  b).  (3  3) 

Assume  that  p-1  is  uniformly  bounded,  that  is,  there  exists  Cp<oo  such  that  |p_1(*)&|  <  CP\b\ 
for  all  x  e  5ftn  and  6  €  3i'.  Note  also  that  these  assumptions  imply  that  if  we  view  the  integral 
version  of  (2.1) 

XT  =  XIT)  =  xo  +  /  /(X(<))  +  »(X(1)W<)<*  ,341 

Jo 

as  a  mapping  from  x0  to  xT  then  this  mapping  is  one-to-one  and  onto  for  any  w  £  L2,  thereby 
ensuring  that  we  may  invert  it. 

Let  po(x0)  be  a  measure  of  our  uncertainty  about  the  initial  state  x0,  and  let  it  have  at 
most  quadratic  growth,  so  that  0  <  po(*)  <  C(1  +  k|2)  for  all  x  €  5ftn  for  some  C  <  00.  Suppose 
we  wish  to  estimate  the  state  at  time  T.  Consider  a  cost  criterion  of  the  form 

fT 

P{T,xt,w(-))  = -po(xo)  ~  KOP  +  KOI2*  (3-5) 

Jo 

=  -Po(*o)-  /TM0I3  +  Ip-iW0)[*(0-jW0)]I3*  (3-6) 

Jo 

where  x0  is  given  by  (3.4)  for  any  particular  w.  Note  that  t>  no  longer  appears  in  (3.6)  since  it 
is  fixed  for  a  given  choice  of  disturbance  w,  measurement  time— history  z  and  specific  terminal 
state  xt-  Similarly,  x0  is  fixed  given  xt  and  w  as  well.  Therefore  we  define  the  value  function 

by  _ 

W(T,xr)=  sup  P(T,xt,w);  (3.7) 

t v£Li 


21 


this  is  also  known  as  the  information  state  (see,  for  instance,  [JB],  [JBE]). 
The  Hamilton-Jacobi  equation  corresponding  to  (3.1),  (3.6),  (3.7)  is 


0  =  -Wt  4-  H(t,x,  VrW)  <>0 

W(0,  z)  = -po(x)  <  =  0 

where  Vr  represents  the  gradient  with  respect  to  the  space  variable  and 

H(t,x,q)=  sup  {-[/(z)  +  <r(z)u;]Tg-H2^.|p-1(z)(z(t)-(7(x))|2} 


(3.8) 


: -fT(x)q  +  \qTa(x)q  -  \ p~l(x){z{t)  -  </(z))|2. 


(3.9a) 

(3.96) 


It  can  be  shown  that  W  is  a  continuous  viscosity  solution  of  (3.8)  [Mcrf]. 

We  will  now  indicate  how  the  solution  to  (3.8)  may  be  used  to  obtain  a  robust  state 
estimate.  First  we  will  indicate  a  direct  approach,  and  then  we  will  combine  the  above  wit 
a  quadratic  estimation  criterion  to  obtain  the  Robust  Filter.  For  the  direct  approach,  note 
that  -W(T,x)  represents,  in  some  sense,  the  minimum  disturbance  energy  needed  for  the 
target  state  at  time  T  to  be  z  for  a  given  measurement  path  *(•)•  The  Mortensen  estimate  is 


ej  =  argmaxl6R.P(r,z). 


(3.10) 


(See  [H],  [Mo]  for  earlier  work  in  this  vein.)  We  can  assert  the  existence  of  the  argmax  in 
(3.10)  under  the  quite  reasonable  assumption  that  there  exists  Cp  >  0  such  that  Po(x)  >  Cp|x|2 

for  all  z  [Mcrf].  .  .... 

Although  the  direct  estimate  is  of  some  interest,  the  following  robust  estimate  will  have 

desirable  properties  (analogous  to  Hoc  control)  as  discussed  below,  and  will  be  the  limit  of 
a  risk-sensitive  filter.  Specifically,  the  information  state  can  be  combined  with  a  quadratic 
estimate  error  criterion  to  obtain  the  Robust  Filter  Let  the  dynamics  and  measurement 
models  be  those  given  above  in  (3.1)  and  (3.2),  and  make  the  same  assumptions.  However, 
now  let  the  cost  criterion  take  the  form 


P{T,e,XT,w)  =  -72 


*  T 

Po(xo)  +  /  M<)|"  +  |v(t)|“  dt 
Jo 


+  \xT~e\ 


where  we  again  note  that  z0  and  »(•)  are  given  by  xT  and  «;(•)•  Let 


W(T  e)  =  sup  sup  P(T,e,xT,w)  =  sup  sup  P(T,  e,zj,  w). 

r0e»n  «k€W  W^w 


(3.11) 


Note  that 


W(T,e)=  sup  [y2W(T,  xt)  +  |zj  —  e|2]  . 


(3.12) 


it6»“ 


It  is  clear  here  that  7  must  be  large  enough  so  that  this  supremum  will  be  finite  and  achieved 
at  some  zT.  Given  the  above  assumptions  we  have  this;  that  is,  there  exists  7o  such  that  the 

supremum  in  (3.12)  is  finite. 


22 


This  lower  bound  on  7  is  directly  analogous  to  the  optimal  disturbance  attenuation 
parameter  in  Hoc  control  below  which  the  supremum  in  that  problem  becomes  unbounded. 
Note  that  in  a  computational  system,  given  a  W,  one  can  choose  a  7  corresponding  to  this 
W  such  that  the  supremum  is  finite. 

Now  note  that  since  W(T,  e)  is  a  supremum  of  functions  which  are  convex  in  e,  it  is  also 
convex  in  e.  Further,  W(T,e )  —  00  as  |e|  — ►  00.  Consequently,  the  minimum  over  e  is  obtained 
at  some  point,  and  we  define  the  Robust  Filter  estimate  at  time  T  as: 

er.=  argmine€#„PF(T,e).  .  (3.13) 

Further,  one  may  choose  7  large  enough  so  that  mine€*n  W(T,e)  <  0,  in  which  case  one  obtains 
the  error  estimate 

| xj  -  er\2  <  72  M*o)  +  IMI2  +  IM|2]  (314) 

for  all  r0  €  3?n  and  all  v,  w  €  L2  where  ||  -||  represents  the  L2-norm  over  [0,T].  This  is  the  robust 
bound  on  the  estimate  error  in  terms  of  the  energy  of  the  disturbance,  and  is  analogous  to 
Hso  disturbance  attenuation  bounds. 

Note  that  the  information  state  computations  are  recursive.  That  is,  having  obtained 
an  estimate  at  time  T,  one  can  obtain  an  estimate  at  time  T  >  T  by  extending  the  solution 
of  (3.8)  from  T  to  f. 

Finally,  we  note  that  this  approach  also  bears  some  similarities  to  the  recent  work  of 
Krener  and  Duarte  [KD] 


3.2  Numerical  Methods 

In  the  previous  subsection,  we  introduced  the  Robust  Filter.  We  also  indicated  that  it 
had  the  desirable  property  of  disturbance  attenuation  (see  (3.14))  analogous  to  control. 
We  also  mentioned  that  a  secondary  advantage  of  Robust  Filtering  for  nonlinear  systems 
is  that  the  corresponding  PDE  is  first-order  (as  opposed  to  the  second-order  nature  of  the 
Zakai  equation).  This  is  an  advantage  in  that  one  may  use  numerical  methods  which  are 
computationally  much  quicker  than  those  available  for  second-order  PDEs.  In  particular, 
one  can  use  the  generalized  method  of  characteristics  described  in  Section  2  as  well  as  grid 
methods  based  on  the  characteristic  flow.  We  will  describe  this  briefly  below.  We  will  also 
indicate  the  Robust  version  of  the  operator  splitting  method  which  Rozovskii  [LRR]  employs 
in  the  stochastic  filter  case. 


3.2.1  Generalized  Method  of  Characteristics 

The  generalized  method  of  characteristics  described  in  Section  2  works  here  in  more-or- 
less  the  same  way.  However,  in  this  case,  there  is  time-dependency  in  the  Hamiltonian  (3.9) 


23 


via  the  measurement  process  z(t).  Thus  there  is  another  component  of  the  characteristic 
system  of  ODE’s.  We  denote  this  component  by  Q\  it  corresponds  to  H',. 


d±  =  -Hp{t,X,P) 

(3.15a) 

di  =  H,0,X,P) 

(3.156) 

^  =  Ht(t,X,P ) 

(3.15c) 

^  =  Q-PTHp(t,X,P ) 

(3.15c/) 

The  characteristics  are  propagated  forward  in  time  from  the  initial  time  to  the  time  at  which 
we  are  attempting  to  estimate  the  state  (the  current  time).  Characterisitics  are  propagated 
forward  from  various  initial  state  points  A"(0)  =  xq.  The  initial  conditions  of  the  other 
components  are  given  by 

U(0)  -  -po(xo) 

P(0)  =  -fW 

Q(0)  =  H(0,xo,P(0)). 

Index  the  characteristics  by  the  initial  state  component  (i.e.  X(t\x0),  P{t\x 0),  etcetera).  Then 
the  value  is  given  by 

WR,x)  =  max{f/(t;xo)  :  X(t\x o)  =  aro) • 

In  [Mcrf]  it  is  shown  that  this  does  indeed  yield  the  information  state,  that  is  W(t,x)  =  VI' (t,  x) 
for  all  t  >  0  and  x  e  3?n. 

Note  that  for  the  Robust  Filter,  it  is  not  necessary  to  solve  the  PDE  over  the  whole 
space  (as  in  stochastic  nonlinear  filtering),  but  only  in  some  small  region  around  the  optimal 
estimate.  This  can  be  used  to  greatly  reduce  the  required  computation.  We  roughly  outline 
the  concept.  Suppose  one  desires  to  estimate  the  state  at  times  t  >  0.  Suppose  one  propagates 
a  characteristic  forward  to  some  time  <1  >  0.  yielding  U(ti\x0).  Then,  turn  to  another 
characteristic  starting  from  xi  at  t  =  0.  If  one  is  propagating  this  new  characteristic,  and  the 
value  falls  significantly  below  U(ti;xo)  at  time  <2  <  <i  (he.  U{t2\xi)  <  U(t i;xo))  then  one  can 
stop  propagating  the  characteristic  for  an  estimate  at  time  <i  since  it  cannot  be  a  point  where 
the  optimal  estimate  occurs.  Taking  this  concept  further,  one  obtains  a  classic  branch-and- 
bound  algorithm.  The  ramification  is  that  one  only  propagates  the  characteristics  forward 
over  a  very  small  region  of  the  state  space. 

Now,  we  indicate  a  computation  of  the  information  state  by  the  characteristic  method  for 
a  simple,  one-dimensional  state  example.  (We  will  not  use  the  branch-and-bound  approach, 
but  simply  propagate  all  the  characteristics  forward  to  the  time  where  we  choose  to  estimate 
the  state.  This  will  produce  a  clearer  graphical  image.)  Once  one  computes  the  information 
state  to  the  desired  accuracy,  the  computation  of  the  Robust  Filter  estimate  (3.13)  is  quite 
straight  forward.  The  purpose  of  this  example  is  simply  to  indicate  that  this  method  can 
provide  an  efficient  means  for  obtaining  Robust  Filter  estimates  for  a  highly  nonlinear  system. 


24 


Let  the  dynamics  and  (discrete-time)  measurement  models  be  given  by 

X  =  4sin(A")  4*  aw  v 

zi  =  ~2~^  +  ^  +  PVi 

with  a  measurement  sample  period  of  0.025  units.  The  solution  is  propagated  forward  1 
time  unit  (i.e.  40  measurement  updates).  The  noise  multipliers  were  taken  to  be  a  —  1  and 
p  =  5.  The  (true)  initial  state  is  X(0)  =  0.  The  initial  information  is  described  by  p0(x)  =  x2 
which  corresponds  to  a  correct  initial  state  estimate.  Note  that  the  dynamics  are  unstable 
in  a  neighborhood  of  the  origin.  The  disturbances  were  chosen  to  be 

w{t)  =  i  +  sin  (-y-*)  and  vi  =  yj sin  (^i) 

so  that  these  disturbances  represent  a  slowly  varying,  biased  noise  signal.  (Note  that  the  Ro¬ 
bust  Filter  does  not  require  a  statistical  iriterpretation  of  these  noise  signals.)  The  resulting 
information  state  appears  in  figure  3.1. 


Figure  3.1:  A  simple  information  state  computation 


3.2.3  Splitting  Method 

A  rather  interesting  approach  which  might  be  computationally  competitive  with  a  char¬ 
acteristic  based  method  is  an  extension  of  the  approach  of  Rozovskii  (see  for  instance  [LRR]) 


25 


to  a  robust  framework.  In  the  Robust  Filter,  this  approach  leads  to  a  very  surprising  and 
quite  interesting  technique.  For  this  technique,  we  will  suppose  that  the  dynamics  remain 
as  a  continuous-time  model  (3.1),  but  that  the  measurements  occur  at  discrete  times 

zi=g(X(ti))  +  p(X(ti))vi.  (3.16) 

Suppose  that  we  wish  to  estimate  the  state  at  time  T,  and  that  there  have  been  N  mea¬ 
surements  in  (0,r].  Then  we  may  adapt  the  cost  criterion  (3.6)  and  value  function  (3.7)  as 
follows.  Let  our  cost  criterion  be 

rT  n 

P(T,xt,w(-))  =  -po(xo)  -  /  -  Y*  (3.17) 

i= i 

and  let  the  value  be  given  by 


W(T,xt)=  sup  P{T,xt,w).  (3.18) 

tv€L  2 

Due  to  the  discrete  nature  of  the  measurements,  it  is  helpful  recall  the  form  of  the 
dynamic  programming  principle  for  such  a  system.  Let  W(t+,x)  be  the  value  at  time  t3  just 
after  measurement  j,  and  W(tJ ,x)  be  the  value  at  time  t3  just  before  the  measurement.  If  T 
is  not  a  measurement  time,  and  t  e  (<*-i,t*),  we  have 

_  '  [_  rT  N 

W(T,x)  =  sup  lW(t,X(t))~  |w(r)|2cfr- y'|p-1(A'(<j))[;j -p(A'(<;))]|2 

(  Jt  JTk 

and  when  T  occurs  at  the  time  of  measurement  j,  we  have 

W(T+,x)  =  W(T~,x)~  \p-\x)[z,  -<7(x)]|2.  (3.19) 

Let  S&t  be  the  solution  operator  which  propagates  the  solution  forward  from  the  time 
ti  to  the  time  <i+i,  that  is  W(<~+1, a:)  =  SAtfiFfti-,*)].  Then  we  can  show  that  this  solution 
operator  for  the  Robust  Filter  has  the  interesting  property  that 

^Atlmaxf^^j  +  ctjjrrmaxISAt^iklxjl  +  cjfe}  (3.20) 

Jt  <  Jf  SNA 

for  any  choice  of  constants  ck.  This  is  analogous  to  a  linear  operator,  L,  in  which  case  one 
has 

HT.  Ck<t>k{xj]  =  ^2  ckL[<j>k{x)].  (3-21) 

k<K  k<K 

More  specifically,  the  addition  of  a  constant  in  (3.20)  is  analogous  to  multiplication  by  a 
constant  in  (3.21),  and  maximization  in  (3.20)  is  analogous  to  summation  in  (3.21).  The 
operations  in  (3.20)  pass  through  the  operator  S^t  in  the  same  way  that  the  operations  in 
(3.21)  pass  through  the  linear  operator  L.  Thus,  5Li  provides  a  very  interesting  analogy  to 
linear  operators  which  is  particularly  suited  to  robust  problems.  And,  in  particular,  (3.20) 
will  provide  a  computationally  effective  method  for  solving  the  Robust  Filter  problem. 


26 


We  approximate  a  function,  tVfx)  in  a  way  analogous  to  the  standard  basis  function 
approach.  Let  <t>k{x)  be  a  set  of  concave-down  functions  which  have  higher  concavity  than 
W  such  that  we  can  approximate  the  function  W  by  maxfc<A'{<Mx)  +  c*}  s0  that 

lim  \W(x)~  max{^t(x)  +  c*}|  =  0 
K  —+oc  k<K 

for  all  x  in  some  region.  For  simplicity,  we  do  not  include  a  display  of  such  functions  ok ■  but 
simply  note  that  for  the  problem  at  hand,  a  series  of  concave-down,  quadratic  functions  is 
sufficient.  We  also  note  that  to  obtain  the  ck,  one  may  take  ck  =  minx[lT(x)  -  <MX)]-  In  this 
way,  one  has  W(x)  >  <j>k(x)  +  ck  for  all  x  and  there  exists  x0  such  that  ty(x0)  =  <M*o)  +  ck. 

Suppose  we  approximate  by  maxjt<A{0fc(')  +  c*}-  Then  we  have  the  approximation 

at  t  =  t~+1  given  by 

W{t~.  j,x)  ~  5At[max{<^i(-)  +  c*}]  =  max{SA, [<?!>*(•)]  +  c*}-  (3.22) 

The  point  is  that  one  may  pre-compute  the  action  of  the  solution  operator  on  the  basis 
functions  4>k  up  to  some  order  K  <  oo,  that  is,  the  Sa«  [</>*]  are  precomputed  and  stored  for  all 
the  basis  functions  <£*,  k  <  K.  We  then  obtain  an  approximation  to  W{tf+l,-)  by  (see  (3.19)) 

W(tf+1,x)  ~  max{5At[^>it(-)]  +  c*}  -  l/,-1(x)[i:j  -  ?(x)]l"-  (3.23) 

kKK 

One  then  expands  this  approximation  of  W(tf+1,x)  in  basis  functions,  and  repeats  the  process 
(3.23)  to  yield  the  approximation  at  the  next  time  step  and  so  on. 

This  method  has  the  same  advantages  that  the  method  of  Rozovskii  has  for  the  stohastic 
problem.  Of  course  our  PDE  is  simpler  to  begin  with  due  to  the  first-order  nature,  so  it  is 
not  clear  at  this  time  whether  this  method  is  computationally  competitive  with  characteristic 
methods. 


3.3  Robust  Filter  as  a  Risk-Sensitive  Limit 

At  the  top  of  this  section  (Section  3),  we  indicated  two  arguments  in  support  of  the 
Robust  Filter.  The  first  being  that,  if  one  is  tracking  a  target  which  is  deliberately  evading 
the  tracking,  one  would  not  expect  diffusion-type  behavior  of  the  target  state.  A  more 
appropriate  model  is  an  L2  disturbance  which  is  chosen  antagonistically  to  our  tracking 
objective.  Secondly,  the  nonlinear  robust  problem  leads  to  a  nonlinear  first-order  PDE 
which  (we  maintain)  is  easier  to  solve  than  the  second— order  Zakai  equation. 

As  further  support  of  the  Robust  Filter  (and  in  particular,  the  above  form  as  opposed  to 
other  forms  suggested  in  [BJP],  [MED]),  we  briefly  present  an  outline  of  the  demonstration 
that  this  filter  is  the  limit  of  the  Risk-Sensitive  Stochastic  Filter.  The  following  discussion 
represents  joint  work  with  W.  H.  Fleming.  It  is  extracted  from  the  more  complete  version 
appearing  in  [FMcf]. 


27 


In  place  of  (3.1),  (3.2),  one  employs  the  stochastic  model 

dXt  =  f(Xt)dt  +  Jl<r(Xt)dBt  (3-24) 

dZt  =  g{X,)dt  +  J^pdBt  Z0  =  0  (3-25) 

where  B.,  B.  are  independent  Brownian  motion  processes.  The  initial  state  A'o  is  independent 
of  B.,  B .,  and  has  density  fctexp(-£_1po(a:)]  for  some  constant  k(.  The  parameter  f  will  be 
important  in  the  risk-sensitive  limit. 

Let  qe  denote  the  unnormalized  conditional  density,  which  is  a  solution  to  the  Zakai 
stochastic  PDE  with  initial  data  g'^*)  =  exp(-f_1po(x)).  Let  Ve  =  -eloggf. 

For  fixed  T,  let 

tf‘(e)=  /  exp  f— |x  —  e|2l  qe(x,T)  dx,  (3-26) 

■where  p  is  another  parameter  similar  to  7.  The  risk  sensitive  filtering  problem  is  to  find  an 
estimate  ij  which  minimizes  4’c{e).  (Note  that  if  one  deleted  the  exponentiation  from  (3.26). 
then  one  would  have  the  standard  stochastic  filter.) 

Minimizing  4e(e)  is  equivalent  to  finding  a  Tj- measurable  estimator  er  which  minimizes 
the  criterion  Eex p  [f  |Ar  -  eT|2],  where  is  the  <r-algebra  generated  by  the  accumulated 
observations  Z,  for  0  <  t  <  T.  Other  authors  consider  instead  the  problem  of  finding  + 
measurable  estimators  et  for  0  <  t  <  T  such  that  Eexp  [f  f0T  \Xt  -  et|2  dt]  is  minimized.  That 
formulation  can  be  considered  as  a  special  case  of  an  output-feedback,  risk  sensitive  op-, 
timal  control  problem  in  which  the  estimate  e,  has  the  role  of  a  minimizing  control.  See 
Boel- J ames-Peter sen  [BJP],  and  Moore-Elliott-Dey  [MED],  and  references  cited  therein. 
In  that  formulation,  the  optimal  risk  sensitive  estimator  involves  the  solution  to  an  infinite- 
dimensional  PDE  for  the  dynamics  of  the  information  state.  By  considering  the  exponential 
terminal  cost  criterion  (3.26),  the  analysis  remains  at  the  level  of  finite-dimensional  PDE  s. 

In  order  to  compare  Risk-Sensitive  Filtering  and  Robust  Filtering,  we  must  suppose  that 
they  view  the  same  particular  measurement  path.  Consequently,  we  let  Z.  be  the  particular 
observation  path  that  we  see.  If  this  is  to  be  a  path  that  could  be  viewed  by  the  Robust 
Filter,  then  it  must  be  differentiable  (assumption  of  finite  energy  in  the  noise  process  in 
Robust  Filtering  -  a  reasonable  assumption).  Let  z(t)  =  Zt  be  the  time-derivative  of  this 
given  observation  path.  Then  V £  satisfies  the  following  PDE.  For  simplicity,  we  assume  that 
tr  is  the  identity,  and  that  p  is  simply  a  scalar. 

Vi  =  -  /  •  W(  -  i|VH2  +  </-  -  jAT)  ■  9  +  (3.27) 

Fe(0,x)  =Po(x)- 

As  e  |  0,  tends  formally  to  a  limit  V°  which  should  satisfy  (in  a  viscosity  sense)  the 


28 


corresponding  PDE  with  e  =  0 

o  =  _v£  _  /  •  VF°  -  ||VT^°|2  -  ^r(T)  •  g  +  ^|</|2 

=  -V$+  inf  [(-f-w)-VV°  +  w2]  +  \\z(T)-g\'2-^\z(T)\2  (328) 

1  »€*”  1  P‘  P 

P°(0,z)  =  Po(*) 

Note  that  by  relpacing  W  in  (3.8)  with  -V°  (and  multiplying  both  sides  by  -l)  and 
subtracting  the  last  term  in  (3.28),  one  obtains  (3.28)  from  (3.8).  This  suggests  the  following 
result  which  we  can  prove  by  the  method  of  viscosity  solutions.  As  e  1  0,  V*  —  V°  uniformly 
on  compact  sets,  where 

V°(T,x)  =  W(T,x)-±  /  \z(t)\2dt. 

That  is,  the  -Vf  converge  to  the  information  state  of  the  Robust  Filter  minus  an  additional 
term  which  is  independent  of  x.  Consequently,  the  Risk-Sensitive  estimate  i‘T  converges  to 
the  robust  estimate  eT  as  f  |  0.  Thus,  the  Robust  Filter  described  in  the  previous  section  is 
the  limit  of  a  stochastic  filter. 


4.  References 

[AA]  R.  Adomaitis  and  E.  H.  Abed,  “Local  nonlinear  control  of  stall  inception  in  axial  flow 
compressors”,  AIAA/SAE/ASME/ASEE  29th  Joint  Propulsion  Conference,  (1993). 

[AHH]  E.  H.  Abed,  P.K.  Houpt  and  W.  M.  Hosny,  “Bifurcation  analysis  of  surge  and  rotating 
stall  in  axial  flow  compressors”,  Journal  of  Turbomachinery,  115  (1993)  817-824. 

[AL]  E.  H.  Abed  and  D.-C.  Liaw,  “Stability  analysis  and  control  of  rotating  stall”,  Proceedings 
of  2nd  IFAC  Nonlinear  Control  Systems  Design  Symposium,  Bordeaux  (1992). 

[B]  J.  Baillieul,  “The  nonlinear  control  theory  of  complex  mechanical  systems”,  Proc.  1995 
AFOSR  Workshop  on  Dynamics  and  Control,  Minneapolis  (1995),  17-20. 

[BH]  J.A.  Ball  and  J.W.  Helton,  “Viscosity  solutions  of  Hamiltonian-Jacobi  equations  arising 
in  nonlinear  Hoo  control”,  SIAM  J.  Control  and  Optim. 

[BHW]  J.  A.  Ball,  J.  W.  Helton  and  M.  L.  Walker,  “Hoc  control  for  nonlinear  systems  with 
output  feedback”,  IEEE  Trans.  Auto.  Control,  38  (1993)  117—164. 

[BJP]  R.  K.  Boel,  M.  R.  James  and  I.  R.  Petersen,  preprint. 

[BS]  M.  Bardi  and  P.  Soravia,  “A  comparison  result  for  Hamilton- Jacobi  equations  and  ap¬ 
plications  to  some  differential  games  lacking  controllability  ,  Funkcialaj  Ekvacioj,  37 
(1994)  19-43. 


29 


[BVVGH]  W.  D.  Blair,  G.  A.  Watson,  G.  L.  Gentry  and  S.  A.  Hoffman.  Benchmark  problem  for 
beam  pointing  control  of  phased  array  radar  against  maneuvering  targets  in  the  presence 
of  ECM  and  false  alarms,  Proc.  ACC,  (1995). 

[CS]  P.  Cannarsa  and  C.  Sinestrari,  “Convexity  properties  of  the  minimum  time  function", 
Preprint  (1994). 

[Day]  M.  V.  Day,  “On  Lagrange  manifolds  and  viscosity  solutions”,  Preprint  (1997). 

[DJ]  P.  M.  Dower  and  M.  R.  James,  “Dissipativity  and  nonlinear  systems  with  finite  power 
gain”,  Submitted  to  Int.  J.  Robust  and  Nonlinear  Control  (1995). 

[DK]  P.  Dupuis  and  H.  Kushner,  “Minimizing  escape  probabilities:  a  large  deviations  ap¬ 
proach”,  SIAM  J.  Control  and  Optim.,  27  (1989)  432-445. 

[DMc]  P.  Dupuis  and  W.  M.  McEneaney,  “Risk-sensitive  and  robust  escape  criteria”,  SIAM  J. 
Control  and  Optim.  ,  Vol.  35  (1997),  2021-2049. 

[DZ]  A.  Dembo  and  0  Zeitouni,  “Large  deviations  techniques  and  applications”,  Jones  and 
Bartlett  Publishers. 

[EK]  R.  J.  Elliott  and  N.  J.  Kalton,  “The  existence  of  value  in  differential  games”,  Memoirs 
of  the  Amer.  Math.  Society,  126  (1972). 

[ES]  L.  C.  Evans  and  P.  E.  Souganidis,  “Differential  games  and  representation  formulas  for 
solutions  of  Hamilton- Jacobi-Isaacs  equations”,  Indiana  Univ.  Math.  J.,  33  (1984) 
773-797. 

[FF1]  M.  Falcone  and  R.  Ferretti,  “A  class  of  fully  discrete  high-order  schemes  for  advection 
equations”,  Preprint  (1994). 

[FF2]  M.  Falcone  and  R.  Ferretti,  “Discrete  time  high-order  schemes  for  viscosity  solutions  of 
Hamilton- Jacobi-Bellman  equations”,  Numerische  Mathematik,  Springer- Verlag  (1993). 

[FGM]  E.  Fernandez-Gaucherand  and  S.I.  Marcus,  “Risk-sensitive  optimal  control  of  hidden 
Markov  models:  a  case  study”,  Proceedings  33rd  IEEE  CDC,  Orlando,  (1994)  1657— 
1662. 

[FHH]  W.H.  Fleming  and  D.  Hernandez-Hernandez,  “Risk-sensitive  control  of  finite  state  ma¬ 
chines  on  an  infinite  horizon  I”,  Submitted  to  SIAM  J.  Control  and  Optim. 

[FMcf]  W.  H.  Fleming  and  W.  M.  McEneaney,  “Risk  sensitive  and  robust  nonlinear  filtering”, 
Submitted  to  36th  IEEE  Conf.  on  Decision  and  Control. 

[FMci]  W.  H.  Fleming  and  W.  M.  McEneaney,  “Risk  sensitive  control  with  ergodic  cost  criteria”, 
Proc.  31st  IEEE  Conf.  on  Decision  and  Control,  1992. 

[FMcr]  W.  H.  Fleming  and  W.  M.  McEneaney,  “Risk  sensitive  optimal  control  and  differential 


30 


games”,  Springer  Lecture  Notes  in  Control  and  Info.  Sci.  No.  184,  1992,  185-197. 

[FMcs]  W.  H.  Fleming  and  W.  M.  McEneaney,  “Risk  sensitive  control  on  an  infinite  time  hori¬ 
zon”,  SIAM  J.  Control  and  Optim.,  33  (1995)  1881-1915. 

[FS]  W.  H.  Fleming  and  H.  M.  Soner,  “Controlled  Markov  Processes  and  Viscosity  Solutions”, 
Springer-Verlag,  1992. 

[FSo]  W.  H.  Fleming  and  P.  E.  Souganidis,  “PDE-Viscosity  Solution  Approach  to  Some  Prob¬ 
lems  of  Large  Deviations”,  Annali  Scuola  Normale  Superiore  -  Pisa,  Serie  4,  Vol  13, 
(1986). 

[FT]  W.  H.  Fleming  and  C.-P.  Tsia,  “Optimal  Exit  Probabilities  and  Differential  Games”, 
Applied  Mathematics  and  Optimization,  7  (1981)  253-282. 

[H]  0.  J.  Hijab,  Minimum  Energy  Estimation,  University  of  California,  Berkeley,  PhD  The¬ 
sis,  1980. 

[I]  A.  Isidori,  “Hoo  control  via  measurement  feedback  for  affine  nonlinear  systems”  Proc. 
31st  IEEE  Conf.  on  Dec.  and  Control,  Dec.  1992. 

[Ja]  M.  R.  James,  “Asymptotic  analysis  of  nonlinear  stochastic  risk-sensitive  control  and 
differential  games”,  Math,  of  Control,  Signals  and  Systems,  5  (1992)  401-417. 

[JB]  M.  R.  James  and  J.  S.  Baras,  Partially  observed  differential  games,  infinite  dimensional 
HJI  equations,  and  nonlinear  Hoe  control,  SIAM  J.  Control  and  Optim. 

[JBE]  M.  R.  James,  J.  S.  Baras  and  R.  J.  Elliott,  Output  feedback  risk-sensitive  control  and 
differential  games  for  continuous-time  nonlinear  systems,  32nd  IEEE  CDC  (1993). 

[Jc]  M.  R.  James,  “Computing  the  Hoc  norm  for  nonlinear  systems”,  Proceedings  12th  IFAC 
World  Congress,  1993. 

[Jp]  M.  R.  James,  “A  partial  differential  inequality  for  dissipative  nonlinear  systems”,  Sys¬ 
tems  and  Control  Letters,  21  (1993)  315-320. 

[KD]  A.  J.  Krener  and  A.  Duarte,  “A  Hybrid  Computational  Approach  to  Nonlinear  Estima¬ 
tion”,  35th  IEEE  CDC,  (1996). 

[LRR]  S.  V.  Lototsky,  C.  Rao  and  B.  L.  Rozovskii,  “Fast  nonlinear  filter  for  continuous- discrete 
time  multiple  models”,  Preprint. 

[Mca]  W.  M.  McEneaney,  “Optimal  aeroassisted  guidance  using  Loh’s  term  approximations”, 
J  Guidance,  Control  and  Dynamics,  14  (1991)  368-376. 

[McD]  W.  M.  McEneaney  and  P.  Dupuis,  “A  Risk-Sensitive  Escape  Criterion  and  Robust 
Limit”,  Proceedings  33rd  IEEE  CDC,  1994. 

[McDa]  W.M.  McEneaney  and  M.V.  Day,  “Characteristic  characterization  of  viscosity  superso- 


31 


lutions  corresponding  to  Ho,  control”,  Proceedings  IFAC  13th  World  Congress,  Vol.  E, 
(1996),  401-406. 

[Mcel]  W.M.  McEneaney,  “Elimination  of  troublesome  disturbances  with  application  to  repre¬ 
sentation  results  for  Hoo  control  DPEs”,  Proc.  7th  International  Symposium  on  Dynamic 
Games,  Shonan  Village,  Japan  (1996),  662-671. 

[McMl]  W.M.  McEneaney  and  K.D.  Mease,  “Nonlinear  Hoo  control  of  aerospace  plane  ascent”, 
34<h  IEEE  CDC  (1995),  3994-3995. 

[McM2]  W.  M.  McEneaney  and  K.  D.  Mease,  “Error  analysis  for  a  guided  Mars  landing”,  J. 
Astronautics,  39  (1991)  423-445. 

[Mcr]  W.  M.  McEneaney,  “Robust  control  and  differential  games  on  a  finite  time  horizon’ , 
Math.  Control,  Signals  and  Systems,  8  (1995),  138-166. 

[Mcrf]  W.  M.  McEneaney,  “Robust/H,*,  filtering  for  nonlinear  systems”,  Submitted  to  Systems 
and  Control  Letters. 

[Met]  W.  M.  McEneaney,  “Connections  between  risk-sensitive  stochastic  control,  differential 
games  and  Hoo  control:  the  nonlinear  case”,  Brown  University  PhD  Thesis,  1993. 

[Mcu]  W.  M.  McEneaney,  “Uniqueness  for  viscosity  solutions  of  nonstationary  HJB  equations 
under  some  a  priori  conditions  (with  applications)”,  SIAM  J.  Control  and  Optim.,  33 
(1995),  1560-1576. 

[MED]  J.  B.  Moore,  R.  J.  Elliott  and  S.  Dey,  “Risk  sensitive  generalizations  of  minimum  variance 
estimation  and  control”,  J.  Math.  Systems,  Estimation  and  Control. 

[MG]  F.  K.  Moore  and  E.  M.  Greitzer,  “A  theory  of  post-stall  transients  in  axial  compression 
systems:  part  1  -  development  of  equations”,  J.  of  Engineering  for  Gas  Turbines  and 
Power,  108  (1986)  68-76. 

[Mi]  S.  Mirica,  “Extending  Cauchy’s  method  of  characteristics  for  Hamilton- Jacobi  equa¬ 
tions”,  Studi  si  Cercetari  Mat.,  37  (1985)  555-565. 

[MK]  K.  D.  Mease  and  J.-P.  Kremer,  “Altitude — path  angle  control  during  aerospace  plane 
ascent”,  AIAA,  (1994). 

[Mo]  R.  E.  Mortensen,  Maximum  Likelihood  Recursive  Nonlinear  Filtering,  J.  Optim.  Theory 
Appl.,  2  (1968)  386-394. 

[MVBl]  K.  D.  Mease  and  M.  A.  Van  Buren,  “Aerospace  plane  guidance  using  time-scale  decom¬ 
position  and  feedback  linearization”,  J.  of  Guidance,  Control  and  Dynamics,  15  (1992) 
1166-1174. 

[R]  T.  Runolfsson,  “Stationary  risk-sensitive  LQG  control  and  its  relation  to  LQG  and  Hoc 
control”,  Proc.  29th  IEEE  CDC,  Dec.  1990,  1018-1023. 


32 


[SFJ]  J.L.  Speyer,  C.-H.  Fan  and  C.R.  Jaensch,  “Centralized  and  decentralized  solutions  of 
the  linear-quadratic-exponential-gaussian  problem”,  Technical  paper,  Dept,  of  Mech., 
Aerospace  and  Nuclear  Engineering,  UCLA,  Los  Angeles  (1990). 

[So]  P.  Soravia,  “Hco  control  of  nonlinear  systems:  differential  games  and  viscosity  solutions'", 
Submitted  to  SIAM  J.  Control  and  Optimization. 

[Sua]  N.  N.  Subbotina,  “Necessary  and  sufficient  optimality  conditions  in  terms  of  character¬ 
istics  of  the  Hamilton- Jacobi-Bellman  equation”,  Rept.  393,  Institut  fur  Angewandte 
Mathematik  und  Statistic,  Universitat  Wurzburg  (1992). 

[vdS]  A.  J.  van  der  Schaft,  “Nonlinear  state  space  Ho,  control  theory”,  Perspectives  in  Control 
(from  series  Progress  in  Systems  and  Control),  Birkhauser,  Boston  1993. 

[W]  P.  Whittle,  “Risk  sensitive  optimal  control”,  Wiley,  1990. 


33 


